Overview

Dataset Statistics

Number of Variables 23
Number of Rows 83784
Missing Cells 74835
Missing Cells (%) 3.9%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 28.9 MB
Average Row Size in Memory 361.2 B
Variable Types
  • Categorical: 5
  • Numerical: 17
  • DateTime: 1

Dataset Insights

HourlyCloudAmount has 74699 (89.16%) missing values Missing
Angle is skewed Skewed
Capacity is skewed Skewed
ClearSkyIrradiance(kWh/m2) is skewed Skewed
Irradiance(kWh/m2) is skewed Skewed
HourlyCloudAmount is skewed Skewed
HourlyPrecipitation is skewed Skewed
Date has a high cardinality: 504 distinct values High Cardinality
Set has constant value "train" Constant
Set has constant length 5 Constant Length
Date has constant length 10 Constant Length
Angle has 26544 (31.68%) negatives Negatives
Angle has 10608 (12.66%) zeros Zeros
ClearSkyIrradiance(kWh/m2) has 40726 (48.61%) zeros Zeros
Irradiance(kWh/m2) has 42462 (50.68%) zeros Zeros
HourlyPrecipitation has 76477 (91.28%) zeros Zeros
  • 1
  • 2

Variables


Set

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 5864880

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row train
2nd row train
3rd row train
4th row train
5th row train

Letter

Count 418920
Lowercase Letter 418920
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Set has words of constant length

ID

numerical

Approximate Distinct Count 3491
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 1812.0115
Minimum 1
Maximum 3584
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ID is skewed left (γ1 = -0.0274)

Quantile Statistics

Minimum 1
5-th Percentile 180
Q1 933
Median 1827
Q3 2701
95-th Percentile 3410
Maximum 3584
Range 3583
IQR 1768

Descriptive Statistics

Mean 1812.0115
Standard Deviation 1030.5792
Variance 1.0621e+06
Sum 1.5182e+08
Skewness -0.02739
Kurtosis -1.1857
Coefficient of Variation 0.5687
  • ID is not normally distributed (p-value 2.1613589899147092e-05)

Date

categorical

Approximate Distinct Count 504
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Memory Size 6283800

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2020-12-14
2nd row 2020-12-14
3rd row 2020-12-14
4th row 2020-12-14
5th row 2020-12-14

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 167568
Decimal Number 670272
  • Date has words of constant length

Lat

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 5883120

Length

Mean 5.2177
Standard Deviation 0.4127
Median 5
Minimum 5
Maximum 6

Sample

1st row 25.11
2nd row 25.11
3rd row 25.11
4th row 25.11
5th row 25.11

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 353376

Lon

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 5939400

Length

Mean 5.8894
Standard Deviation 0.3136
Median 6
Minimum 5
Maximum 6

Sample

1st row 121.26
2nd row 121.26
3rd row 121.26
4th row 121.26
5th row 121.26

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 409656

Angle

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean -20.2745
Minimum -160
Maximum 22
Zeros 10608
Zeros (%) 12.7%
Negatives 26544
Negatives (%) 31.7%
  • Angle is skewed left (γ1 = -1.7525)

Quantile Statistics

Minimum -160
5-th Percentile -160
Q1 -31
Median 1.76
Q3 4.63
95-th Percentile 22
Maximum 22
Range 182
IQR 35.63

Descriptive Statistics

Mean -20.2745
Standard Deviation 52.3203
Variance 2737.4097
Sum -1.6987e+06
Skewness -1.7525
Kurtosis 1.772
Coefficient of Variation -2.5806
  • Angle is not normally distributed (p-value 5.188722038837592e-10)
  • Angle has 14664 outliers

Module

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 6716976
  • The largest value (AUO PM060MW3 320W) is over 1.9 times larger than the second largest value (MM60-6RT-300)

Length

Mean 15.1702
Standard Deviation 2.293
Median 17
Minimum 12
Maximum 17

Sample

1st row MM60-6RT-300
2nd row MM60-6RT-300
3rd row MM60-6RT-300
4th row MM60-6RT-300
5th row MM60-6RT-300

Letter

Count 543072
Lowercase Letter 0
Space Separator 100176
Uppercase Letter 543072
Dash Punctuation 74976
Decimal Number 552792
  • The top 2 categories (AUO PM060MW3 320W, MM60-6RT-300) take over 50.0%

Capacity

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 349.136
Minimum 99.2
Maximum 499.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Capacity is skewed left (γ1 = -0.4334)

Quantile Statistics

Minimum 99.2
5-th Percentile 99.2
Q1 246.4
Median 352
Q3 498.56
95-th Percentile 499.8
Maximum 499.8
Range 400.6
IQR 252.16

Descriptive Statistics

Mean 349.136
Standard Deviation 144.6514
Variance 20924.0396
Sum 2.9252e+07
Skewness -0.4334
Kurtosis -1.1392
Coefficient of Variation 0.4143
  • Capacity is not normally distributed (p-value 4.2902930672030256e-18)

CapacityFactor

numerical

Approximate Distinct Count 2992
Approximate Unique (%) 3.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 3.9202
Minimum 0.1346
Maximum 7.2713
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CapacityFactor is skewed left (γ1 = -0.7819)

Quantile Statistics

Minimum 0.1346
5-th Percentile 0.9226
Q1 3.0925
Median 4.315
Q3 5.052
95-th Percentile 5.7205
Maximum 7.2713
Range 7.1367
IQR 1.9595

Descriptive Statistics

Mean 3.9202
Standard Deviation 1.4773
Variance 2.1823
Sum 328448.0668
Skewness -0.7819
Kurtosis -0.3276
Coefficient of Variation 0.3768
  • CapacityFactor has 24 outliers

ArrayRatio

numerical

Approximate Distinct Count 3482
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 0.8246
Minimum 0.2679
Maximum 1.4106
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ArrayRatio is skewed right (γ1 = 0.3402)

Quantile Statistics

Minimum 0.2679
5-th Percentile 0.6362
Q1 0.7574
Median 0.8162
Q3 0.8848
95-th Percentile 1.0346
Maximum 1.4106
Range 1.1426
IQR 0.1274

Descriptive Statistics

Mean 0.8246
Standard Deviation 0.1254
Variance 0.01573
Sum 69090.6194
Skewness 0.3402
Kurtosis 2.6799
Coefficient of Variation 0.1521
  • ArrayRatio is not normally distributed (p-value 3.6489996583251594e-05)
  • ArrayRatio has 5208 outliers

Generation(kWd)

numerical

Approximate Distinct Count 1952
Approximate Unique (%) 2.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 1349.3357
Minimum 17
Maximum 3280
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Generation(kWd) is skewed right (γ1 = 0.2953)

Quantile Statistics

Minimum 17
5-th Percentile 262
Q1 578
Median 1288
Q3 1976
95-th Percentile 2708
Maximum 3280
Range 3263
IQR 1398

Descriptive Statistics

Mean 1349.3357
Standard Deviation 789.8259
Variance 623824.9336
Sum 1.1305e+08
Skewness 0.2953
Kurtosis -1.0059
Coefficient of Variation 0.5853

Irradiance(kWd/m2)

numerical

Approximate Distinct Count 944
Approximate Unique (%) 1.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 4.8319
Minimum 0.1889
Maximum 8.0056
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWd/m2) is skewed left (γ1 = -0.6696)

Quantile Statistics

Minimum 0.1889
5-th Percentile 1.0806
Q1 3.7167
Median 5.2361
Q3 6.2583
95-th Percentile 7.3111
Maximum 8.0056
Range 7.8167
IQR 2.5417

Descriptive Statistics

Mean 4.8319
Standard Deviation 1.8746
Variance 3.5142
Sum 404834.3333
Skewness -0.6696
Kurtosis -0.4322
Coefficient of Variation 0.388

Datetime

datetime

Distinct Count 12099.1984
Approximate Unique (%) 14.4%
Missing 0
Missing (%) 0.0%
Memory Size 670400
Minimum 2020-06-09 00:00:00
Maximum 2021-10-28 23:00:00

ClearSkyIrradiance(kWh/m2)

numerical

Approximate Distinct Count 34625
Approximate Unique (%) 41.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 0.2718
Minimum 0
Maximum 0.9768
Zeros 40726
Zeros (%) 48.6%
Negatives 0
Negatives (%) 0.0%
  • ClearSkyIrradiance(kWh/m2) is skewed right (γ1 = 0.7916)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.004617
Q3 0.6107
95-th Percentile 0.9228
Maximum 0.9768
Range 0.9768
IQR 0.6107

Descriptive Statistics

Mean 0.2718
Standard Deviation 0.3452
Variance 0.1191
Sum 22768.7959
Skewness 0.7916
Kurtosis -1.0182
Coefficient of Variation 1.2702
  • ClearSkyIrradiance(kWh/m2) is not normally distributed (p-value 6.095593375783368e-25)

Irradiance(kWh/m2)

numerical

Approximate Distinct Count 400
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 0.1869
Minimum 0
Maximum 1.1806
Zeros 42462
Zeros (%) 50.7%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWh/m2) is skewed right (γ1 = 1.3086)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.3306
95-th Percentile 0.7972
Maximum 1.1806
Range 1.1806
IQR 0.3306

Descriptive Statistics

Mean 0.1869
Standard Deviation 0.274
Variance 0.07507
Sum 15659.9056
Skewness 1.3086
Kurtosis 0.4053
Coefficient of Variation 1.4659
  • Irradiance(kWh/m2) is not normally distributed (p-value 5.712359646609383e-25)
  • Irradiance(kWh/m2) has 3364 outliers

HourlyTemperature

numerical

Approximate Distinct Count 660
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 25.0782
Minimum 5.5
Maximum 37.2
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HourlyTemperature is skewed left (γ1 = -0.7834)

Quantile Statistics

Minimum 5.5
5-th Percentile 15.1
Q1 21.5
Median 26.7
Q3 29
95-th Percentile 31.7
Maximum 37.2
Range 31.7
IQR 7.5

Descriptive Statistics

Mean 25.0782
Standard Deviation 5.3438
Variance 28.5562
Sum 2.1012e+06
Skewness -0.7834
Kurtosis -0.1263
Coefficient of Variation 0.2131
  • HourlyTemperature is not normally distributed (p-value 0.007033127502325589)
  • HourlyTemperature has 664 outliers

HourlyHumidity

numerical

Approximate Distinct Count 118
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 77.9753
Minimum 19
Maximum 100
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HourlyHumidity is skewed left (γ1 = -0.2842)

Quantile Statistics

Minimum 19
5-th Percentile 60
Q1 71
Median 78
Q3 85
95-th Percentile 96
Maximum 100
Range 81
IQR 14

Descriptive Statistics

Mean 77.9753
Standard Deviation 10.7887
Variance 116.3964
Sum 6.5331e+06
Skewness -0.2842
Kurtosis 0.2388
Coefficient of Variation 0.1384
  • HourlyHumidity is not normally distributed (p-value 0.008848881475692363)
  • HourlyHumidity has 772 outliers

HourlyWindSpeed

numerical

Approximate Distinct Count 289
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 3.3699
Minimum 0
Maximum 15.7
Zeros 286
Zeros (%) 0.3%
Negatives 0
Negatives (%) 0.0%
  • HourlyWindSpeed is skewed right (γ1 = 1.1566)

Quantile Statistics

Minimum 0
5-th Percentile 0.4
Q1 1.3
Median 2.6
Q3 4.7
95-th Percentile 9
Maximum 15.7
Range 15.7
IQR 3.4

Descriptive Statistics

Mean 3.3699
Standard Deviation 2.6778
Variance 7.1705
Sum 282346.85
Skewness 1.1566
Kurtosis 0.8409
Coefficient of Variation 0.7946
  • HourlyWindSpeed is not normally distributed (p-value 0.008713182647389798)
  • HourlyWindSpeed has 2544 outliers

HourlyCloudAmount

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.1%
Missing 74699
Missing (%) 89.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 145360
Mean 6.2295
Minimum 0
Maximum 10
Zeros 908
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • HourlyCloudAmount is skewed left (γ1 = -0.5284)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 3
Median 7
Q3 9
95-th Percentile 10
Maximum 10
Range 10
IQR 6

Descriptive Statistics

Mean 6.2295
Standard Deviation 3.4856
Variance 12.1493
Sum 56595
Skewness -0.5284
Kurtosis -1.1428
Coefficient of Variation 0.5595
  • HourlyCloudAmount is not normally distributed (p-value 3.5166297293594995e-12)

HourlyPrecipitation

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 136
Missing (%) 0.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 1338368
Mean 0.04013
Minimum 0
Maximum 1
Zeros 76477
Zeros (%) 91.3%
Negatives 0
Negatives (%) 0.0%
  • HourlyPrecipitation is skewed right (γ1 = 4.6393)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0.3
Maximum 1
Range 1
IQR 0

Descriptive Statistics

Mean 0.04013
Standard Deviation 0.1629
Variance 0.02655
Sum 3356.7
Skewness 4.6393
Kurtosis 21.5097
Coefficient of Variation 4.0604
  • HourlyPrecipitation is not normally distributed (p-value 4.468443274101436e-25)
  • HourlyPrecipitation has 7171 outliers

Hour

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 11.5
Minimum 0
Maximum 23
Zeros 3491
Zeros (%) 4.2%
Negatives 0
Negatives (%) 0.0%

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 5.75
Median 11.5
Q3 17.25
95-th Percentile 22
Maximum 23
Range 23
IQR 11.5

Descriptive Statistics

Mean 11.5
Standard Deviation 6.9222
Variance 47.9172
Sum 963516
Skewness 0
Kurtosis -1.2042
Coefficient of Variation 0.6019
  • Hour is not normally distributed (p-value 8.530609293613251e-198)

DayOfYear

numerical

Approximate Distinct Count 366
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 0.5294
Minimum 0.002732
Maximum 1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • DayOfYear is skewed left (γ1 = -0.3228)

Quantile Statistics

Minimum 0.002732
5-th Percentile 0.07104
Q1 0.3579
Median 0.5574
Q3 0.7295
95-th Percentile 0.9071
Maximum 1
Range 0.9973
IQR 0.3716

Descriptive Statistics

Mean 0.5294
Standard Deviation 0.2484
Variance 0.06168
Sum 44355.6066
Skewness -0.3228
Kurtosis -0.734
Coefficient of Variation 0.4691
  • DayOfYear is not normally distributed (p-value 0.003033092568465965)

DayOfYearTransformed

numerical

Approximate Distinct Count 262
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1340544
Mean 0.5721
Minimum 0
Maximum 1
Zeros 168
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • DayOfYearTransformed is skewed left (γ1 = -0.2424)

Quantile Statistics

Minimum 0
5-th Percentile 0.08197
Q1 0.3661
Median 0.5847
Q3 0.8142
95-th Percentile 0.9672
Maximum 1
Range 1
IQR 0.4481

Descriptive Statistics

Mean 0.5721
Standard Deviation 0.2727
Variance 0.07435
Sum 47935.3443
Skewness -0.2424
Kurtosis -0.9647
Coefficient of Variation 0.4766
  • DayOfYearTransformed is not normally distributed (p-value 0.005764334356381923)

Interactions

Correlations

Missing Values